112 research outputs found

    Advanced algorithms for audio and image processing

    Get PDF
    The objective of the thesis is the development of a set of innovative algorithms around the topic of beamforming in the field of acoustic imaging, audio and image processing, aimed at significantly improving the performance of devices that exploit these computational approaches. Therefore the context is the improvement of devices (ultrasound machines and video/audio devices) already on the market or the development of new ones which, through the proposed studies, can be introduced on new the markets with the launch of innovative high-tech start-ups. This is the motivation and the leitmotiv behind the doctoral work carried out. In fact, in the first part of the work an innovative image reconstruction algorithm in the field of ultrasound biomedical imaging is presented, which is connected to the development of such equipment that exploits the computing opportunities currently offered nowadays at low cost by GPUs (Moore\u2019s law). The proposed target is to obtain a new pipeline of the reconstruction of the image abandoning the architecture of such hardware based In the first part of the thesis I faced the topic of the reconstruction of ultrasound images for applications hypothesized on a software based device through image reconstruction algorithms processed in the frequency domain. An innovative beamforming algorithm based on seismic migration is presented, in which a transformation of the RF data is carried out and the reconstruction algorithm can evaluate a masking of the k-space of the data, speeding up the reconstruction process and reducing the computational burden. The analysis and development of the algorithms responsible for carrying out the thesis has been approached from a feasibility point in an off-line context and on the Matlab platform, processing both synthetic simulated generated data and real RF data: the subsequent development of these algorithms within of the future ultrasound biomedical equipment will exploit an high-performance computing framework capable of processing customized kernel pipelines (henceforth called \u2019filters\u2019) on CPU/GPU. The type of filters implemented involved the topic of Plane Wave Imaging (PWI), an alternative method of acquiring the ultrasound image compared to the state of the art of the traditional standard B-mode which currently exploit sequential sequence of insonification of the sample under examination through focused beams transmitted by the probe channels. The PWI mode is interesting and opens up new scenarios compared to the usual signal acquisition and processing techniques, with the aim of making signal processing in general and image reconstruction in particular faster and more flexible, and increasing importantly the frame rate opens up and improves clinical applications. The innovative idea is to introduce in an offline seismic reconstruction algorithm for ultrasound imaging a further filter, named masking matrix. The masking matrices can be computed offline knowing the system parameters, since they do not depend from acquired data. Moreover, they can be pre-multiplied to propagation matrices, without affecting the overall computational load. Subsequently in the thesis, the topic of beamforming in audio processing on super-direct linear arrays of microphones is addressed. The aim is to make an in depth analysis of two main families of data-independent approaches and algorithms present in the literature by comparing their performances and the trade-off between directivity and frequency invariance, which is not yet known at to the state-of-the-art. The goal is to validate the best algorithm that allows, from the perspective of an implementation, to experimentally verify performance, correlating it with the characteristics and error statistics. Frequency-invariant beam patterns are often required by systems using an array of sensors to process broadband signals. In some experimental conditions, the array spatial aperture is shorter than the involved wavelengths. In these conditions, superdirective beamforming is essential for an efficient system. I present a comparison between two methods that deal with a data-independent beamformer based on a filter-and-sum structure. Both methods (the first one numerical, the second one analytic) formulate a mathematical convex minimization problem, in which the variables to be optimized are the filters coefficients or frequency responses. In the described simulations, I have chosen a geometry and a set-up of parameters that allows us to make a fair comparison between the performances of the two different design methods analyzed. In particular, I addressed a small linear array for audio capture with different purposes (hearing aids, audio surveillance system, video-conference system, multimedia device, etc.). The research activity carried out has been used for the launch of a high-tech device through an innovative start-up in the field of glasses/audio devices (https://acoesis.com/en/). It has been proven that the proposed algorithm gives the possibility of obtaining higher performances than the state of the art of similar algorithms, additionally providing the possibility of connecting directivity or better generalized directivity to the statistics of phase errors and gain of sensors, extremely important in superdirective arrays in the case of real and industrial implementation. Therefore, the method proposed by the comparison is innovative because it quantitatively links the physical construction characteristics of the array to measurable and experimentally verifiable quantities, making the real implementation process controllable. The third topic faced is the reconstruction of the Room Impluse Response (RIR) using audio processing blind methods. Given an unknown audio source, the estimation of time differences-of-arrivals (TDOAs) can be efficiently and robustly solved using blind channel identification and exploiting the cross-correlation identity (CCI). Prior blind works have improved the estimate of TDOAs by means of different algorithmic solutions and optimization strategies, while always sticking to the case N = 2 microphones. But what if we can obtain a direct improvement in performance by just increasing N? In the fourth Chapter I tried to investigate this direction, showing that, despite the arguable simplicity, this is capable of (sharply) improving upon state-of-the-art blind channel identification methods based on CCI, without modifying the computational pipeline. Inspired by our results, we seek to warm up the community and the practitioners by paving the way (with two concrete, yet preliminary, examples) towards joint approaches in which advances in the optimization are combined with an increased number of microphones, in order to achieve further improvements. Sound source localisation applications can be tackled by inferring the time-difference-of-arrivals (TDOAs) between a sound-emitting source and a set of microphones. Among the referred applications, one can surely list room-aware sound reproduction, room geometry\u2019s estimation, speech enhancement. Despite a broad spectrum of prior works estimate TDOAs from a known audio source, even when the signal emitted from the acoustic source is unknown, TDOAs can be inferred by comparing the signals received at two (or more) spatially separated microphones, using the notion of cross-corrlation identity (CCI). This is the key theoretical tool, not only, to make the ordering of microphones irrelevant during the acquisition stage, but also to solve the problem as blind channel identification, robustly and reliably inferring TDOAs from an unknown audio source. However, when dealing with natural environments, such \u201cmutual agreement\u201d between microphones can be tampered by a variety of audio ambiguities such as ambient noise. Furthermore, each observed signal may contain multiple distorted or delayed replicas of the emitting source due to reflections or generic boundary effects related to the (closed) environment. Thus, robustly estimating TDOAs is surely a challenging problem and CCI-based approaches cast it as single-input/multi-output blind channel identification. Such methods promote robustness in the estimate from the methodological standpoint: using either energy-based regularization, sparsity or positivity constraints, while also pre-conditioning the solution space. Last but not least, the Acoustic Imaging is an imaging modality that exploits the propagation of acoustic waves in a medium to recover the spatial distribution and intensity of sound sources in a given region. Well known and widespread acoustic imaging applications are, for example, sonar and ultrasound. There are active and passive imaging devices: in the context of this thesis I consider a passive imaging system called Dual Cam that does not emit any sound but acquires it from the environment. In an acoustic image each pixel corresponds to the sound intensity of the source, the whose position is described by a particular pair of angles and, in the case in which the beamformer can, as in our case, work in near-field, from a distance on which the system is focused. In the last part of this work I propose the use of a new modality characterized by a richer information content, namely acoustic images, for the sake of audio-visual scene understanding. Each pixel in such images is characterized by a spectral signature, associated to a specific direction in space and obtained by processing the audio signals coming from an array of microphones. By coupling such array with a video camera, we obtain spatio-temporal alignment of acoustic images and video frames. This constitutes a powerful source of self-supervision, which can be exploited in the learning pipeline we are proposing, without resorting to expensive data annotations. However, since 2D planar arrays are cumbersome and not as widespread as ordinary microphones, we propose that the richer information content of acoustic images can be distilled, through a self-supervised learning scheme, into more powerful audio and visual feature representations. The learnt feature representations can then be employed for downstream tasks such as classification and cross-modal retrieval, without the need of a microphone array. To prove that, we introduce a novel multimodal dataset consisting in RGB videos, raw audio signals and acoustic images, aligned in space and synchronized in time. Experimental results demonstrate the validity of our hypothesis and the effectiveness of the proposed pipeline, also when tested for tasks and datasets different from those used for training. Chapter 6 closes the thesis, presenting a development activity of a new Dual Cam POC to build-up from it a spin-off, assuming to apply for an innovation project for hi-tech start- ups (such as a SME instrument H2020) for a 50Keuro grant, following the idea of the technology transfer. A deep analysis of the reference market, technologies and commercial competitors, business model and the FTO of intellectual property is then conducted. Finally, following the latest technological trends (https://www.flir.eu/products/si124/) a new version of the device (planar audio array) with reduced dimensions and improved technical characteristics is simulated, simpler and easier to use than the current one, opening up new interesting possibilities of development not only technical and scientific but also in terms of business fallout

    Efeito da área verde no preço da terra urbana em Curitiba

    Get PDF
    Orientador : Alexandre Alves PorsseMonografia (graduação) - Universidade Federal do Paraná, Setor de Ciências Sociais Aplicadas, Curso de Ciências EconômicasInclui referênciasResumo : O presente trabalho tem como objetivo analisar o efeito das áreas verdes no preço da terra urbana em Curitiba e o impacto causado pela distância de determinados parques, de modo que seja possível avaliar se existe uma relação positiva entre o preço da terra e a presença desta amenidade urbana. Este estudo reunirá informações referente a terrenos vagos e com construção precária tendo como método a regressão econométrica de equação de preços hedônicos. Também será apresentada uma breve revisão literária da teoria urbana e uma breve discussão do modelo hedônico de preços. Em posse deste material, a pesquisa fornecerá possíveis evidencias desta relação e como ela contribui na formação do preço da terra na cidade de Curitiba. Os resultados indicam que terrenos localizados até 100 metros de uma área verde possuem um ganho médio no preço do metro quadrado de 12,9%

    Subjective Fear in Virtual Reality: A Linear Mixed-Effects Analysis of Skin Conductance

    Get PDF
    he investigation of the physiological and pathological processes involved in fear perception is complicated due to the difficulties in reliably eliciting and measuring the complex construct of fear. This study proposes a novel approach to induce and measure subjective fear and its physiological correlates combining virtual reality (VR) with a mixed-effects model based on skin conductance (SC). Specifically, we developed a new VR scenario applying specific guidelines derived from horror movies and video games. Such a VR environment was used to induce fear in eighteen volunteers in an experimental protocol, including two relaxation scenarios and a neutral virtual environment. The SC signal was acquired throughout the experiment, and after each virtual scenario, the emotional state and fear perception level were assessed using psychometric scales. We statistically evaluated the greatest sympathetic activation induced by the fearful scenario compared to the others, showing significant results for most SC-derived features. Finally, we developed a rigorous mixed-effects model to explain the perceived fear as a function of the SC features. Model-fitting results showed a significant relationship between the fear perception scores and a combination of features extracted from both fast- and slow-varying SC components, proposing a novel solution for a more objective fear assessme

    Can a Humanoid Face be Expressive? A Psychophysiological Investigation

    Get PDF
    Non-verbal signals expressed through body language play a crucial role in multi-modal human communication during social relations. Indeed, in all cultures, facial expressions are the most universal and direct signs to express innate emotional cues. A human face conveys important information in social interactions and helps us to better understand our social partners and establish empathic links. Latest researches show that humanoid and social robots are becoming increasingly similar to humans, both esthetically and expressively. However, their visual expressiveness is a crucial issue that must be improved to make these robots more realistic and intuitively perceivable by humans as not different from them. This study concerns the capability of a humanoid robot to exhibit emotions through facial expressions. More specifically, emotional signs performed by a humanoid robot have been compared with corresponding human facial expressions in terms of recognition rate and response time. The set of stimuli included standardized human expressions taken from an Ekman-based database and the same facial expressions performed by the robot. Furthermore, participants’ psychophysiological responses have been explored to investigate whether there could be differences induced by interpreting robot or human emotional stimuli. Preliminary results show a trend to better recognize expressions performed by the robot than 2D photos or 3D models. Moreover, no significant differences in the subjects’ psychophysiological state have been found during the discrimination of facial expressions performed by the robot in comparison with the same task performed with 2D photos and 3D models

    Edad y crecimiento del pámpano, Trachinotus ovatus, del Estrecho de Messina (Mediterráneo central)

    Get PDF
    This is the first paper to provide detailed information on the age and growth of Trachinotus ovatus. The size of the 244 individuals collected in the Strait of Messina ranged from 2.7 to 30.4 cm in fork length (LF) and 0.31 to 508.6 g in body mass (M). The relationship between these parameters (M - LF) was investigated and showed a good fit. Age estimation based on vertebrae and otoliths yielded similar results, suggesting a maximum age of five years. However, the precision and accuracy tests, such as percentage of agreement (PA), mean coefficient of variation (ACV) and average percent error (APE) indicated that the otolith readings (97.83% PA, 0.54% ACV and 0.38% APE) were more reliable for age estimation than vertebrae readings (82.17% PA, 5.33% ACV and 3.77% APE). The multi-model inference approach allowed us to compare different non-linear growth models. The von Bertalanffy model (L∞=29.139, k=0.496 and t0=−0.347) fitted the length-at-age data best. This species has a relatively rapid growth and an estimated longevity of five to seven years. This information could be used for management and first stock assessment studies on T. ovatus in the Mediterranean Sea.Este es el primer artículo que proporciona información detallada sobre la edad y el crecimiento de Trachinotus ovatus. El tamaño de los 244 individuos recolectados en el Estrecho de Messina osciló entre 2.7 y 30.4 cm de longitud a la horquilla (LF) y 0.31 a 508.6 g de masa corporal (M). Se investigó la relación entre estos parámetros (M - LF) y mostró un buen ajuste. La estimación de la edad basada en vértebras y otolitos arrojó resultados similares, lo que sugiere una edad máxima de cinco años. Sin embargo, las pruebas de precisión y exactitud, como el porcentaje de concordancia (PA), el coeficiente medio de variación (ACV) y el porcentaje de error medio (APE), indicaron que las lecturas del otolito (97.83% PA, 0.54% ACV y 0.38 % APE) fueron más confiables para la estimación de la edad que las lecturas de vértebras (82.17% PA, 5.33% ACV y 3.77% APE). El enfoque de inferencia de modelos múltiples nos permitió comparar diferentes modelos de crecimiento no lineal. El modelo de von Bertalanffy (L∞=29.139, k=0.496 and t0=−0.347) se ajusta mejor a los datos de talla por edad. Esta especie tiene un crecimiento relativamente rápido y una longevidad estimada de cinco a siete años. Esta información podría utilizarse para la ordenación y los primeros estudios de evaluación de poblaciones de T. ovatus en el mar Mediterráneo

    When opportunistic predators interact with swordfish harpoon fishing activities: shark depredation over catches in the Strait of Messina (central Mediterranean Sea)

    Get PDF
    We describe the interaction between harpoon fishing activity and sharks, which opportunistically depredated harpoon catches in the Strait of Messina. Shark bite marks were observed on harpooned swordfish during the period 2014–2020, with different damages to the catches. Most of the depredation events have focused on large swordfish, generally weighing more than 60 kg. Data on direct observations were implemented by interviews and questionnaires to fishermen aimed to recover the information on their local fishing and ecological knowledge. Fishermen provided additional data on shark-harpoon fishing interactions also supplying information on by-catch species (i.e., bluefin tuna). Therefore, these results suggest that sharks migrating through the Strait of Messina are occasionally attracted by injured prey, due to their ability to detect chemical cues, fish distress stimuli and body fluids (i.e. blood) in the water. In addition, our investigations showed an increase in shark attacks on harpooned fish over time, likely due to an increase in harpoon swordfish catches. This may be related to the effects of the driftnets' ban enforced by European Regulations in the last decades

    Remediation of Hg-contaminated marine sediments by simultaneous application of enhancing agents and microwave heating (MWH)

    Get PDF
    The aim of this work was to investigate Hg removal ability of a novel microwave heating (MWH) treatment for marine sediment remediation enhanced by the application of several agents, biodegradable complexing agent (methylglycinediacetic acid, MGDA), surfactant (Tween® 80), and citric acid. Main results revealed that MWH allowed a very rapid heating (∼450 °C in 7 min) of the irradiated medium. However, without the addition of enhancing agents, a maximum Hg removal of ∼72% can be achieved. The application of MGDA led to a higher contaminant removal of ∼87% (residual concentration = 5.4 mg kg−1). For the treatment including the simultaneous addition of both chelating agent and surfactant, their synergetic action and stripping processes resulted in a very high Hg removal of ∼99% for an irradiation time of 7 min, corresponding to a residual concentration of 0.56 mg kg−1, which is lower than the Italian regulatory limit of 1 mg kg−1. The use of citric acid resulted in a shortening of the removal kinetics, which allowed the successful application of a shorter remediation time of 5 min. The observed strong passive ability of sediments to convert a microwave irradiation energy into a rapid and large temperature increase undoubtedly represents a key factor in the whole remediation process, making the studied treatment an excellent choice. Kinetic data are suitable for a preliminarily assessment of the effectiveness of clean-up activities, and as basis for future scaling-up studies on MWH of Hg-contaminated sediment

    Evidence of a predation event on a tagged Mediterranean spearfish (Tetrapturus belone; Pisces, Istiophoridae), inferred from pop-up satellite tagging data

    Get PDF
    The Strait of Messina is located at the centre of the Mediterranean Sea and is considered a biodiversity hotspot and an obligatory seasonal passage for different pelagic species such as sharks, marine mammals, and billfishes. For the first time, in the Strait of Messina, our research group tagged a Mediterranean spearfish (Tetrapturus belone) using a pop-up satellite archival tag (PSAT). The observation of abiotic parameters (depth, light, and temperature) recorded by the PSAT confirmed that the tagged specimen was predated after about nine hours. The tag was then regurgitated 14 days after the tag deployment date. The analysis of collected data seems to indicate that the predator may be an ectothermic shark, most likely the bluntnose sixgill shark (Hexanchus griseus)

    miR-340 predicts glioblastoma survival and modulates key cancer hallmarks through down-regulation of NRAS

    Get PDF
    Glioblastoma is the most common primary brain tumor in adults; with a survival rate of 12 months from diagnosis. However, a small subgroup of patients, termed long-term survivors (LTS), has a survival rate longer then 12–14 months. There is thus increasing interest in the identification of molecular signatures predicting glioblastoma prognosis and in how to improve the therapeutic approach. Here, we report miR-340 as prognostic tumor-suppressor microRNA for glioblastoma. We analyzed microRNA expression in > 500 glioblastoma patients and found that although miR-340 is strongly down-regulated in glioblastoma overall, it is up-regulated in LTS patients compared to short-term survivors (STS). Indeed, miR-340 expression predicted better prognosis in glioblastoma patients. Coherently, overexpression of miR-340 in glioblastoma cells was found to produce a tumor-suppressive activity. We identified NRAS mRNA as a critical, direct target of miR-340: in fact, miR-340 negatively influenced multiple aspects of glioblastoma tumorigenesis by down-regulating NRAS and downstream AKT and ERK pathways. Thus, we demonstrate that expression of miR-340 in glioblastoma is responsible for a strong tumor-suppressive effect in LTS patients by down-regulating NRAS. miR-340 may thus represent a novel marker for glioblastoma diagnosis and prognosis, and may be developed into a tool to improve treatment of glioblastoma
    • …
    corecore